Contextual Entropy and Reconstruction of Quantum States 
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We introduce a new notion of entropy for quantum states, called contextual entropy, and show 
how it unifies Shannon and von Neumann entropy. The main result is that from the knowledge of 
the contextual entropy of a quantum state of a finite-dimensional system, one can reconstruct the 
quantum state, i.e., the density matrix, if the Hilbert space is of dimension 3 or greater. We present 
an explicit algorithm for this state reconstruction and relate our result to Gleason's theorem. 

PACS numbers: 03.67.-a 



INTRODUCTION 

Quantum Information Theory has brought the impor- 
tance of information-theoretic concepts to the forefront 
of physics. Quantum systems have been shown to be 
able to perform information-theoretic tasks beyond the 
capabilities of classical systems, e.g. secure key commit- 
ment quantum teleportation [ij], factoring primes in 
polynomial time Q, and many others. Here, we will com- 
pare quantum systems to classical systems at the level of 
their associated information-theoretic notions, in partic- 
ular we consider entropy of physical states. For classical 
states, Shannon entropy Q is typically used, for quan- 
tum states, von Neumann entropy Q. We will show how 
a new notion of contextual entropy unifies these two. 

Let p be a quantum state. In the following, we will dis- 
tinguish between the (fixed and basis-independent) state 
p itself and the basis-dependent density matrix p rep- 
resenting it. The von Neumann entropy S(p) of p is 
the Shannon entropy of a specific probability distribu- 
tion, given by the diagonal elements of p in a basis in 
which p is diagonal. Picking such an orthonormal basis 
means choosing a particular measurement context. Yet, 
infinitely many other measurement contexts (in which p 
typically is not a diagonal matrix) are available and have 
a well-defined operational meaning. 

In this article, we show that it is fruitful to take all pos- 
sible measurement contexts into account and consider a 
family of Shannon entropies, one for each context. This 
leads to the new notion of the contextual entropy of 
a quantum state, which is a real-valued function from 
which the density matrix p and hence the state p can be 
reconstructed. This also provides an extension of Glea- 
son's theorem. 

DEFINITION OF CONTEXTUAL ENTROPY 

Consider a finite-dimensional quantum system with 
Hilbert space rl = C", for example a spin system, or 
a system of m qubits, in which case n — 2 m . Let 
(Pi,...Pfc) be a family of projection operators on rl 
such that PiPj = SijPi and F, = 1. Such a 

family is called a context (or sometimes a resolution of 
the identity) and describes a measurement with k out- 



comes. Conceptually, a measurement context is a 'clas- 
sical perspective' on the quantum system. A context 
C := (Pi, . . . Pfc) can be interpreted as the family of pro- 
jections onto the eigenspaces of an observable of the form 
A = ^2i=i a iP with k distinct real eigenvalues a^. The 
are not determined by the Pi (and their actual numerical 
values do not matter). 

Given a context C = (Pi, ... Pfc), we can obtain coarse- 
grained contexts by introducing degeneracy: for example, 
C = (Pi + Pa, P3, . . . , Pfc) is another context in which 
the outcomes 1 and 2 cannot be distinguished. We write 
C < C if each projection in C is either in C or is the 
sum of projections in C, and every projection in C shows 
up in such a sum exactly once. In this case we say that 
the context C is coarser (or more degenerate) than the 
context G. This gives a partial order on the set C of all 
contexts of our quantum system. 

A context C is called maximal if there are no con- 
texts C ^ C such that C < C. If C = (P u . . . ,P„) is 
maximal, then the n projections Pi arc of rank 1 and 
C corresponds to a non-degenerate measurement. Each 
rank-1 projection Pi determines a unit vector \vbi) such 
that \ijji)(il>i\ = Pi. This vector is unique up to a phase. 
Hence, with each maximal context we can associate an 
orthonormal basis (|"0i), . . . , |f/Vi}) of H that is unique up 
to phases. If C = (Pi,..., Pfc) with k < n is a non- 
maximal context, one can still find a (non- unique) or- 
thonormal basis of H such that all the Pi in C are given 
by diagonal matrices with respect to this basis, since all 
the projections in C commute. 

Not all contexts C, C £ C can be compared with re- 
spect to the order defined above. This corresponds to 
the fact that some measurements are incompatible. How- 
ever, incompatible contexts can be coarse-grained to the 
same context C. Consider for example the three spin 
operators Si in H = C 2 . Each Si defines a context Cj, 
given by the two projections onto the eigenspaces of Si. 
As is well-known, the Si do not commute, but they all 
commute with the total spin S 2 . The latter defines a 
context C that just contains the identity operator 1 and 
hence is coarser than C\ , Ci and C3 . This indicates that 
the partial order gives a rich structure to the set C of all 
contexts. For some results on the structure of C see @,0]. 
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Let p be a state of a finite-dimensional quantum sys- 
tem, described by a density matrix p. We will say that 
p is diagonal in a context C = (Pi, . . . ,-Pfe) if p is of the 
form p = J2i=i A*P»- This is justified by the following: 
if B = ( | -0x ),•■•, li'n)) is an orthonormal basis of % such 
that all projections Pi in C are given by diagonal matri- 
ces with respect to B, then p is diagonal with respect to 
B if and only if p = £\ A^p. 

Given a measurement with k outcomes described by 
a context C , the quantum state p assigns a probability 
Pi := Prob(z; p) = tr(pp) to each outcome i — l,...,k. 
This defines a probability distribution Pc ■= (pi, . ■ ■ ,Pk)- 
Since a context can be interpreted as a classical perspec- 
tive on a quantum system, it makes sense to assign a 
(classical) Shannon entropy to the probability distribu- 
tion Pc, which is H(Pc) = — E»=i Pi In Pi- Yet, no con- 
text C is preferred over the others, so we consider all 
contexts C € C. This gives a map 



E p :C 



C =(P 1 ,...,P k ) 



[0,lnn] 

H(P C ) = H(tv(pP{] 



MPPk))- 



Here, [0, In n] denotes the real interval from to In n. The 
map E p is called the contextual entropy of the quantum 
state p. It is a family of Shannon entropies, one for each 
context C E C. 

PROPERTIES OF CONTEXTUAL ENTROPY 

We show two important properties of contextual en- 
tropy. The first of these highlights the connection be- 
tween the von Neumann and the contextual entropy of a 
state and will be useful for our reconstruction algorithm. 
The second one shows that the contextual entropy is a 
monotone map. 

Von Neumann entropy from contextual en- 
tropy. We first show that there exists at least 
one maximal context C p G C such that E p (C p ) = 
S(p), the von Neumann entropy of the state p: let 
C p = (Pi,...,P n ) be a maximal context in which 
p is diagonal, that is, p = E"=i ^iP. Then 
p c p = (tr(pA), ■ ■ • ,tr(pP n )) = (Ai,...,A„) and hence 
E p (C p ) = H(P Cp ) = -Ei A, In A, = S(p), which is what 
we wanted to show. 

Secondly, recall that a vector r = (ri,...,r n ) of 
real numbers is majorised by another real vector s = 
(si,...,s n ) if Y!i=i T i = J2i=i s i an d> for au k < n, 



sf, where the r\ are the components of 



J- 



r rearranged in decreasing order, and similarly for the s 
1- 

Using the majorisation order, we can show a stronger 
result: the von Neumann entropy S(p) of a quantum 
state p is the minimal value of the contextual entropy 
Ep(C) when C is varying over maximal contexts. 

To see this, let C = (P\, . . . ,P n ) be any maximal con- 
text. Necessarily, all projections p S C are of rank 



1. Consider the matrix representation of p with re- 
spect to the orthonormal basis Be = (\ipi) , ■ ■ ■ , \ip n )) 
associated with the context C. This matrix has a sin- 
gle 1 in position (i, i) and zeros everywhere else. Then 
Pi = tr(pP) = tr(pP?) = tr(PpP) is the ith diago- 
nal element of the density matrix p when written with 
respect to the basis Be- Hence, the probability distribu- 
tion Pc = (pi, ■ ■ ■ ,p n ) in context C determined by the 
state p consists of the diagonal elements of the density 
matrix p when the latter is written with respect to the 
basis Be associated with C. 

The Schur-Horn theorem Q states that the vector 
1 = (Ai, . . . , A„) of eigenvalues of a Hcrmitian diagonal 
matrix M majoriscs the vector k = (k\, . . . , «„) of diag- 
onal elements of the Hcrmitian matrix UMU^ 1 obtained 
after a change of basis. In our situation, M = p, and a 
change of basis amounts to a change of maximal context. 
So, if Cp = (Qi, ■ ■ ■ , Q n ) is a maximal context in which p 
is diagonal and C is any other maximal context, then the 
vector Pc p = (Ai, . . . , A„) of eigenvalues of p majorises 
the vector Pc = (pi, . . . ,p n ) of diagonal elements of p 
written with respect to the basis associated with C . 

It is well-known that the Shannon entropy reverses the 
majorisation order, so it follows that the contextual en- 
tropy E p takes its minimal value on the set of maximal 
contexts of the form C p for which p is a diagonal ma- 
trix. We saw above that for such contexts C p , the value 
E p (Cp) of the contextual entropy is the von Neumann 
entropy S(p) of the quantum state. This completes the 
proof. 

Monotonicity. Now consider two contexts C,C S C 
such that C is coarser than C. Wc want to compare 
E p (C) and E p (C). Let C = (P 1 ,...,P k ), and let 



(Qh---,Q 



^2' ■ 



Qf ), where 



C = 

y^j—i Qj = Pi for all i = 1, . . . , k. Given a quantum 

state p, we have pi = tr(pp) = tr(pQ\) + . . . + tr(/5Q;.) 

for all i = 1, . . . , k. Let us denote q 1 - := tr(/5Qj), where 
i = 1, . . . , k and j = 1, . . . , Then, u sing the recursion 
property of Shannon entropy (see e.g. [l0(), 



E p (C)=H(ql,. 



k 




H(pi, . . . ,p k ) + > \p t H 



E P (C) 



Since the last term above is always non-negative, it fol- 
lows that E p (C) < E p (C) for all contexts C < C. 
Hence, the map E p : C — »• [0,lnn] is indeed order- 
preserving. 

RECONSTRUCTION OF QUANTUM STATES 

We now show that the contextual entropy of a quan- 
tum state p contains enough information to uniquely re- 
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construct the density matrix p if the dimension of the 
Hilbcrt space is at least 3. We assume that we can iden- 
tify a maximal context C p for which E p takes its mini- 
mal value among all maximal contexts. (In general, the 
minimal value of E p will be attained in many different 
maximal contexts, but any of them will do.) We remark 
in passing that the same reconstruction algorithm would 
also work for contextual Renyi entropies. 

Pure states. Assume that p is pure, i.e., p = \4>){'4>\ 
for some unit vector G "H. We want to determine 
IV'XV'I from the contextual entropy E p . 

We first note that a state p is pure if and only if 
Ep{C p ) = 0. This follows from the previous section, 
where we showed that E p (C p ) = S(p), the von Neumann 
entropy of p. The latter is equal to if and only if p is a 
pure state. 

Let C = (Pi, . . . , P n ) be a context for which E P (C) = 
0. Then there is a unique p £ C such that (ip\ = P ■ 
Using finitely many values of the contextual entropy E p , 
it is possible to determine which of the n projections 
Pi,...,P n equals l^)^. For this, consider n unitary 
matrices Ui, . . . , U n such that ^PjC/. = P\ and, for all 
1 < j < n, j ^ i, 

That is, Ui leaves the projection Pj invariant and 'rotates' 
the other projections Pj, j ^ i without resulting in a 
permutation of any of them. Such unitaries always exist 
if dim"H > 3: for example, a unitary keeping Pi fixed 
and rotating all other Pj has a matrix of the form 



1 


T 




U' 



where the unitary U' G U(n — 1) can be taken to be 
a rotation by some small angle around an axis different 
from all the n — 1 coordinate directions for j = 2, . . . , n. 

Consider the maximal contexts of the form Cj := 
(U.PiU- 1 , U.PnU- 1 ) for i = 1, . . . , n. Only the con- 
text Ci Q contains the projection Pi = \ip)(ip\, which is 
the state that we are looking for. Hence, the density ma- 
trix p is only diagonal with respect to the orthonormal 
basis associated with the maximal context Cj , while it 
is not diagonal with respect to the bases associated with 
any of the contexts Cj, j ^= Iq. If the density matrix 
p of a projection \ip)(ip\ is not diagonal with respect to 
some basis, then there are at least two non-zero entries on 
the diagonal, so the probability distribution (pi, . . . ,p n ) 
given by the diagonal elements of p has Shannon entropy 
strictly larger than 0. 

Hence, E p (C ia ) = 0, while E p (Cj) > for all j ^ i - 
In this way, we can identify i$ and can determine the 
state \ip)(ip\. 

Mixed states. We now show how to reconstruct p 
from E p when p is a mixed state and dimH > 3. 



Step 1. Find a maximal context C p = (P\,...,P n ) 
for which the value E p (C p ) is minimal (but larger than 
since p is mixed). The Schur-Horn theorem implies that 
the state which we want to determine is diagonal in C p , 
that is, p = Y^i=i ^iPi- We have to find the eigenvalues 
Aj of p. 

Step 2. Let Cj := (P,l — Pj) be the context con- 
taining just Pj and its complement, for each i = 1, . . . , n. 
Then E p (Ci) is the Shannon entropy of the probability 
distribution (ti(pPi), 1 — tr(/5Pj)). 

Of course, the actual values Xi := tv(pPi) and 1 — Aj = 
1 — tr(pPj)) are unknown so far. We just have the binary 
entropy 

E p (Ci) = -Xi teiXi - (1 - Xi) ln(l - Xi). 

Solving this, we obtain two solutions, Ci and 1 — Cj. We 
can assume without loss of generality that Cj < i < 1 — Cj . 
For each i, the eigenvalue Xi of the density matrix p is 
either a or 1 — a . 

Step 3. We write the solutions c\ . . . , c n and 1 — 
Ci, . . . , 1 — Cn in two rows such that 1 — Cj is beneath 
Ci. In order to find the eigenvalues Xi of the density ma- 
trix p = Xip, written with respect to the basis 
associated with C p , we must choose n numbers from this 
table, one from each column, such that their sum is equal 
to 1 (since Y17=i ^ ~ At least one such solution must 
exist, since we assumed that E p is the contextual entropy 
of some quantum state p. 

(a) If the numbers c,; in the top row add up to 1, then 
we have found the unique solution: A.j := Cj for each 
i = 1, ... ,n. Picking any 1 — Cj instead of c, would make 
the total sum greater than 1. 

(b) If the sum S := Yn=i Ci °^ * ne elements in the top 
row is smaller than 1, then we must replace at least one 
value Cj from the top row with 1 — Cj from the bottom 
row. Note, however, that since the entries of the bottom 
row are all greater or equal to \ , we can only pick exactly 
one such entry, since picking two or more would make the 
sum of our n chosen elements greater than 1. 

To determine which Cj (from the top row) must be 
replaced by 1 — Cj (from the bottom row) among the n 
elements we pick, we note the following: the sum ci + . . .+ 
Cj_i + (1 — Cj ) + Cj+i + . . . + c„ must equal 1 , which implies 
that Cj must have the value c := ( Cl +—+ C ") = S < l 

(bl) If the value c appears only once among the entries 
of the top row, for example in the jth column, then our 
unique solution is X 3 ■ = 1 — Cj and Xi = Cj for all i =/= j. 

(b2) If, however, the value c appears twice among the 
entries of the top row, the bottom row has 1 — c in the 
corresponding two columns. This implies that the top 
row has 0s in all other columns, since otherwise the sum 
of 1 — c (the unique entry picked from the bottom row) 
and the sum of the n — 1 entries from the other colums of 
the top row would be larger than 1. This also implies that 
c cannot appear three or more times among the entries 
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of the top row. Assume that the two entries of c appear 
in the jth and fcth columns. Our state is then cither 



or 



P = cPj + (1 - c)P k 
p = cP k + {l-c)P j . 



(1) 
(2) 



In order to determine which of these is the correct solu- 
tion, consider a unitary U which rotates, but does not 
permute, all the projections Pi,...,P n except for Pj, 
which it leaves unchanged. Here, we need dim'H > 3. 
The jth eigenvalues of U~ 1 pU and p (that is, the eigen- 
values for the joint eigenvector determined by Pj) coin- 
cide and are equal to Xj , while the other eigenvalues are 
distinct in general. We consider the contexts of the form 
Wi = (UPiU^ 1 , 1 - UPiU^ 1 ) and solve the equations 

E p (Wi) = -di Indj - (1 - dj) ln(l - di). 

We write the solutions in a second table, again with the 
convention that the top row contains entries smaller than 
4. We then repeat the procedure detailed above (from 
Step 3 onwards) of choosing n numbers adding up to 

1, this time from the second table. These numbers will 
be equal to the diagonal entries of the matrix U~ x pU , 
written in the basis in which p is diagonal. 

Because of our choice of unitary, the diagonal entries 
of the density matrix \J~ x pU will contain at least three 
non-zero elements, so we do not enter the (b2) branch of 
our algorithm again, since the top row of the second tabic 
will also contain at least three non-zero entries. Hence, 
this time there will be a unique choice of n entries adding 
up to 1. In particular, the jth clement of this solution 
equals the jth diagonal entry of the matrix U^ 1 j5U 1 which 
is the same as Xj, the jth eigenvalue of p. This allows us 
to choose the correct quantum state from the two possible 
solutions |T]) and @ . We finally remark that for dim T-L = 

2, the qubit case, we can determine the state up to the 
ambiguity between (JT|) and ©. 

RELATION TO GLEASON'S THEOREM 

Let T-L = C", n > 3, and let /i be a (finitely additive) 
probability measure p. : V(H) — > [0, 1] on the projections 
of H, that is p(l) = 1 and if PQ = QP = 6, then 
p(P + Q)=p(P)+p(d). 

Gleason's theorem (which we only consider for the 
finite-dimensional case here) states that for every such 
probability measure fi, there exists a quantum state p p 
such that, for all projections P € V(H), 

tr(^P) = fx(P). (3) 

Conversely, every quantum state p gives a probability 
measure p p : P(TL) — > [0,1] simply by setting p p {P) := 
tr(pP) for all P G T{U). 

A probability measure p defines a probability distri- 
bution Pc = (n(Pi), . . . ,/x(Pfe)) for each context C = 
(Pi, . . . ,Pfc) . Gleason's theorem shows that if wc have a 



family (Pc)cec of probability distributions, one for each 
context, that come from a measure p, then there is a 
unique quantum state p p such that eq. © holds. 

Given p, we can construct the corresponding contex- 
tual entropy E p : to each context C = (Pi, . . . ,Pfc), we 
assign the Shannon entropy of the probability distribu- 
tion Pc- As was shown in the previous section, one can 
reconstruct the quantum state p p from its contextual en- 
tropy Eft. This requires a single real number E p (C) for 
each context C = (Pi,...,Pfe) instead of the k num- 
bers p(Pi), ■ ■ ■ ,p,(Pk)- Moreover, we obtain an explicit 
density matrix from our reconstruction, while Gleason's 
theorem merely shows that a density matrix must exist. 
In this sense, our approach via the contextual entropy is 
an extension of Gleason's result. 

On the other hand, Gleason's theorem guarantees that 
every probability measure p : V{T-L) — >• [0, 1] corresponds 
to a quantum state, while we had to assume that the 
map E p : C — ► [0,lnn] which wc use in our reconstruction 
actually is the contextual entropy of some quantum state. 

SUMMARY AND OUTLOOK 

Given a quantum state p on a finite-dimensional 
Hilbcrt space H and a measurement context C = 
(Pi, . . . , P n ), we can extract the probability distribution 
Pc = (tr(pPi), . . . ,tr(pP n )) by repeated preparations 
and measurements. In contrast to the quantum state it- 
self, measurement contexts have direct operational mean- 
ing. The contextual entropy E p : C — >• [0,lnn], which is 
a real-valued function, assigns to each probability distri- 
bution Pc its Shannon entropy and hence encodes data 
that can be extracted operationally from the quantum 
state p. 

The fact that the state p can be reconstructed from 
its contextual entropy E p if dim % > 3 provides a new, 
information-theoretic characterisation of quantum states 
that takes contextuality into account explicitly. The re- 
sults in this article connect directly with the so-called 
topos approach to quantum theory T^i HI in which con- 
textuality is a key concept. In future work, we will de- 
velop these connections in depth and will also consider 
infinite-dimensional systems. 

We presented a number of properties of contextual en- 
tropy and discussed how the reconstruction of a quantum 
state from its contextual entropy relates to Gleason's the- 
orem. As matters stand, the properties we presented 
do not characterise contextual entropy fully: there are 
functions F : C —> [0, Inn] that have all the properties 
we discussed, but are not the contextual entropy of any 
quantum state. 

Finding an axiomatic characterisation of exactly those 
functions which are contextual entropies of quantum 
states would, together with our reconstruction algorithm, 
provide an alternative proof of Gleason's theorem. This 
is an interesting and non-trivial open problem. 
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